Fault Isolation and Quick Recovery in Isolation File Systems
نویسندگان
چکیده
Lanyue Lu presented isolation file systems, providing fault isolation and quick recovery within a single file system. Because file systems are important data access interfaces in many environments, high availability is critical; however, a single fault can trigger a large-scale impact for the whole file system, such as remounting as read-only and a system crash. Lanyue explained how a metadata corruption of a virtual machine disk image can affect multiple virtual machines that share the same hyper visor file system. Lanyue argued that modern file systems do not provide fine-grained fault isolation, thus all the files within a file system share a single fault domain.
منابع مشابه
Fault Detection and Isolation of Multi-Agent Systems via Complex Laplacian
This paper studies the problem of fault detection and isolation (FDI) for multi-agent systems (MAS) via complex Laplacian subject to actuator faults. A planar formation of point agents in the plane using simple and linear interaction rules related to complex Laplacian is achieved. The communication network is a directed, and yet connected graph with a fixed topology. The loss of symmetry in the...
متن کاملOnline Fault Detection and Isolation Method Based on Belief Rule Base for Industrial Gas Turbines
Real time and accurate fault detection has attracted an increasing attention with a growing demand for higher operational efficiency and safety of industrial gas turbines as complex engineering systems. Current methods based on condition monitoring data have drawbacks in using both expert knowledge and quantitative information for detecting faults. On account of this reason, this paper proposes...
متن کاملRecovery Strategies for Linear Replication
Replicated systems are commonly used to provide highly available applications. In last years, these systems have been mostly based on the use of atomic broadcast protocols, and a wide range of solutions have been published. The use of these atomic broadcast-based protocols also has aided to develop recovery protocols providing fault tolerance to replicated systems. However, this research has be...
متن کاملA Recovery Model for Extended Real-Time Transactions
A central problem in the design of fault-tolerant realtime systems is that desirable fault-tolerance properties are usually realized by mechanisms that counteract realtime guarantees. A prominent example is the All-orNothing property (also known as failure atomicity) known from transactions. This property normally is realized by the means of isolation and roll-back recovery. However, isolation ...
متن کاملMonitoring, Predic and Fault Isolation in Dynamic
‘Diagnosis of dynamic physical systems is complex and requires close interaction of monitoring, fault generation and refinement, and prediction. We establish a methodology for model-based diagnosis of continuous systems in a qualitative reasoning framework. A temporal causal model capturing dynamic system behavior identifies faults from deviant measurements and predicts future system behavior e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013